Analysis of Complexities for finding efficient Association Rule Mining Algorithms

نویسندگان

  • R. Rathinasabapathy
  • R. Bhaskaran
چکیده

Several algorithms for association rule mining, have been implemented including a variation of Apriori, an algorithm using hash functions for finding large 2-itemsets and 3-itemsets and direct search method for finding other large k-itemsets, and another variation of Eclat algorithm using perfect hash functions for 2-itemsets and 3-itemsets and the method of vertical mining for finding other large k-itemsets. All these algorithms were compared, among themselves by finding time complexities of each algorithm, for finding factors affecting the efficiency of the algorithms. By observing the complexities we are able to (i) find factors that is influencing execution times (ii) find why one algorithm works faster than the other and (iii) find convincing method of choosing best algorithm with the given constraints such as database composition, maximum frequent set size and itemset size factors. Keyword: Clustering, classification, and association rules, Data mining, Analysis of algorithms, Graph theory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining

Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...

متن کامل

Optimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining

The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...

متن کامل

New Approaches to Analyze Gasoline Rationing

In this paper, the relation among factors in the road transportation sector from March, 2005 to March, 2011 is analyzed. Most of the previous studies have economical point of view on gasoline consumption. Here, a new approach is proposed in which different data mining techniques are used to extract meaningful relations between the aforementioned factors. The main and dependent factor is gasolin...

متن کامل

Efficient adaptive frequent pattern mining techniques for market analysis in sequential and parallel systems

The classical applications of Association Rule Mining (ARM) are market analysis, network traffic analysis, and web log analysis where strategic decisions are made by analyzing the frequent itemsets from a large pool of data. Datasets in such domains are constantly updated and as they require an efficient Frequent Pattern Mining (FPM) algorithm which is capable of extracting the required informa...

متن کامل

Exploiting Parallelism in Association Rule Mining Algorithms

Association rule mining is one of the major technique of data mining, involves finding of frequent itemsets with minimum support and generating association rule among them with minimum confidence. The task of finding all frequent itemsets for a large datasets requires a lot of computation which can be minimized by exploiting parallelism to the sequential algorithms. In this paper, we provide th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011